library(labelled)
library(codebook)
library(tidyverse)
knitr::opts_chunk$set(
  warning = FALSE, # show warnings during codebook generation
  message = FALSE, # show messages during codebook generation
  error = TRUE, # do not interrupt codebook generation in case of errors,
                # usually makes debugging easier, and sometimes half a codebook
                # is better than none
  echo = FALSE  # don't show the R code
)
ggplot2::theme_set(ggplot2::theme_bw())

Load data

Codebook

Metadata

Description

Dataset name: vcs

The dataset has N=2241 rows and 22 columns. 969 rows have no missing values on any column.

Metadata for search engines

  • Date published: 2020-07-10
x
voice_id
ID
dataset
sex
age
f0
f1
f2
f3
f4
pf
neuro
extra
openn
agree
consc
dominance
behavior
attitude
desire
soi_full
sex_c

#Variables

voice_id

Distribution

Distribution of values for voice_id

Distribution of values for voice_id

0 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
voice_id numeric 0 1 1 1121 2241 1121 647.0653 ▇▇▇▇▇ NA

ID

Distribution

Distribution of values for ID

Distribution of values for ID

539 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
ID numeric 539 0.7594824 1 1264 9.5e+10 56101998 2296931472 ▇▁▁▁▁ NA

dataset

Distribution

Distribution of values for dataset

Distribution of values for dataset

0 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
dataset numeric 0 1 1 4 11 4.531905 2.864653 ▇▃▃▂▁ NA

sex

Sex

Distribution

Distribution of values for sex

Distribution of values for sex

0 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd n_value_labels hist
sex Sex haven_labelled 0 1 -1 -1 1 -0.1726908 0.9851959 2 ▇▁▁▁▁▁▁▆

Value labels

Response choices
name value
male 1
female -1

age

Age

Distribution

Distribution of values for age

Distribution of values for age

137 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
age Age numeric 137 0.9388666 18 23 56 24.452 6.138718 ▇▂▁▁▁

f0

Voice pitch

Distribution

Distribution of values for f0

Distribution of values for f0

7 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
f0 Voice pitch numeric 7 0.9968764 75 192 303 172.2847 50.53207 ▇▃▇▇▁

f1

Distribution

Distribution of values for f1

Distribution of values for f1

4 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
f1 numeric 4 0.9982151 -2.8 0.013 5.1 0 1 ▂▇▅▁▁ NA

f2

Distribution

Distribution of values for f2

Distribution of values for f2

4 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
f2 numeric 4 0.9982151 -4.7 0.13 3.3 0 1 ▁▁▇▇▁ NA

f3

Distribution

Distribution of values for f3

Distribution of values for f3

4 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
f3 numeric 4 0.9982151 -3.8 0.12 3.2 0 1 ▁▅▇▇▁ NA

f4

Distribution

Distribution of values for f4

Distribution of values for f4

4 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
f4 numeric 4 0.9982151 -2.8 0.19 2.7 0 1 ▁▇▅▇▁ NA

pf

Formants

Distribution

Distribution of values for pf

Distribution of values for pf

4 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
pf Formants numeric 4 0.9982151 -2.7 0.081 3.2 0 0.7932869 ▁▅▇▂▁

neuro

Neuroticism

Distribution

Distribution of values for neuro

Distribution of values for neuro

802 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
neuro Neuroticism numeric 802 0.6421241 -2.1 -0.25 1.8 -0.192606 0.7633565 ▁▆▇▅▂

extra

Extraversion

Distribution

Distribution of values for extra

Distribution of values for extra

803 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
extra Extraversion numeric 803 0.6416778 -2.5 0.5 2 0.4468985 0.7401981 ▁▁▅▇▃

openn

Openness

Distribution

Distribution of values for openn

Distribution of values for openn

802 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
openn Openness numeric 802 0.6421241 -1.6 0.8 2 0.7602201 0.5969557 ▁▁▅▇▃

agree

Agreeableness

Distribution

Distribution of values for agree

Distribution of values for agree

802 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
agree Agreeableness numeric 802 0.6421241 -2.1 0.62 1.9 0.583538 0.6197164 ▁▁▅▇▃

consc

Conscientiousness

Distribution

Distribution of values for consc

Distribution of values for consc

802 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
consc Conscientiousness numeric 802 0.6421241 -2.1 0.56 2 0.4977692 0.6936437 ▁▂▆▇▃

dominance

Dominance

Distribution

Distribution of values for dominance

Distribution of values for dominance

1256 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
dominance Dominance numeric 1256 0.4395359 -1.9 0.4 2 0.3785218 0.5866429 ▁▂▇▇▂

behavior

Distribution

Distribution of values for behavior

Distribution of values for behavior

242 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
behavior numeric 242 0.8920125 -2.2 -1.1 2.2 -0.9135086 1.055427 ▇▅▃▂▁ NA

attitude

Distribution

Distribution of values for attitude

Distribution of values for attitude

239 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
attitude numeric 239 0.8933512 -2.2 0.56 2.2 0.3653193 1.218307 ▃▅▆▇▆ NA

desire

Distribution

Distribution of values for desire

Distribution of values for desire

238 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
desire numeric 238 0.8937974 -2.2 -0.19 2.2 -0.1796899 1.044242 ▃▇▇▆▂ NA

soi_full

Unrestricted sociosexuality

Distribution

Distribution of values for soi_full

Distribution of values for soi_full

243 missing values.

Summary statistics

name label data_type n_missing complete_rate min median max mean sd hist
soi_full Unrestricted sociosexuality numeric 243 0.8915663 -2.2 -0.22 2 -0.24281 0.8572349 ▂▆▇▅▁

sex_c

Distribution

Distribution of values for sex_c

Distribution of values for sex_c

0 missing values.

Summary statistics

name data_type n_missing complete_rate min median max mean sd hist label
sex_c numeric 0 1 -1 -1 1 -0.1726908 0.9851959 ▇▁▁▁▆ NA

Missingness report

Codebook table

JSON-LD metadata The following JSON-LD can be found by search engines, if you share this codebook publicly on the web.

{
  "name": "vcs",
  "datePublished": "2020-07-10",
  "description": "The dataset has N=2241 rows and 22 columns.\n969 rows have no missing values on any column.\n\n\n## Table of variables\nThis table contains variable names, labels, and number of missing values.\nSee the complete codebook for more.\n\n|name      |label                       | n_missing|\n|:---------|:---------------------------|---------:|\n|voice_id  |NA                          |         0|\n|ID        |NA                          |       539|\n|dataset   |NA                          |         0|\n|sex       |Sex                         |         0|\n|age       |Age                         |       137|\n|f0        |Voice pitch                 |         7|\n|f1        |NA                          |         4|\n|f2        |NA                          |         4|\n|f3        |NA                          |         4|\n|f4        |NA                          |         4|\n|pf        |Formants                    |         4|\n|neuro     |Neuroticism                 |       802|\n|extra     |Extraversion                |       803|\n|openn     |Openness                    |       802|\n|agree     |Agreeableness               |       802|\n|consc     |Conscientiousness           |       802|\n|dominance |Dominance                   |      1256|\n|behavior  |NA                          |       242|\n|attitude  |NA                          |       239|\n|desire    |NA                          |       238|\n|soi_full  |Unrestricted sociosexuality |       243|\n|sex_c     |NA                          |         0|\n\n### Note\nThis dataset was automatically described using the [codebook R package](https://rubenarslan.github.io/codebook/) (version 0.9.3).",
  "keywords": ["voice_id", "ID", "dataset", "sex", "age", "f0", "f1", "f2", "f3", "f4", "pf", "neuro", "extra", "openn", "agree", "consc", "dominance", "behavior", "attitude", "desire", "soi_full", "sex_c"],
  "@context": "http://schema.org/",
  "@type": "Dataset",
  "variableMeasured": [
    {
      "name": "voice_id",
      "@type": "propertyValue"
    },
    {
      "name": "ID",
      "@type": "propertyValue"
    },
    {
      "name": "dataset",
      "@type": "propertyValue"
    },
    {
      "name": "sex",
      "description": "Sex",
      "value": "1. male,\n-1. female",
      "maxValue": 1,
      "minValue": -1,
      "@type": "propertyValue"
    },
    {
      "name": "age",
      "description": "Age",
      "@type": "propertyValue"
    },
    {
      "name": "f0",
      "description": "Voice pitch",
      "@type": "propertyValue"
    },
    {
      "name": "f1",
      "@type": "propertyValue"
    },
    {
      "name": "f2",
      "@type": "propertyValue"
    },
    {
      "name": "f3",
      "@type": "propertyValue"
    },
    {
      "name": "f4",
      "@type": "propertyValue"
    },
    {
      "name": "pf",
      "description": "Formants",
      "@type": "propertyValue"
    },
    {
      "name": "neuro",
      "description": "Neuroticism",
      "@type": "propertyValue"
    },
    {
      "name": "extra",
      "description": "Extraversion",
      "@type": "propertyValue"
    },
    {
      "name": "openn",
      "description": "Openness",
      "@type": "propertyValue"
    },
    {
      "name": "agree",
      "description": "Agreeableness",
      "@type": "propertyValue"
    },
    {
      "name": "consc",
      "description": "Conscientiousness",
      "@type": "propertyValue"
    },
    {
      "name": "dominance",
      "description": "Dominance",
      "@type": "propertyValue"
    },
    {
      "name": "behavior",
      "@type": "propertyValue"
    },
    {
      "name": "attitude",
      "@type": "propertyValue"
    },
    {
      "name": "desire",
      "@type": "propertyValue"
    },
    {
      "name": "soi_full",
      "description": "Unrestricted sociosexuality",
      "@type": "propertyValue"
    },
    {
      "name": "sex_c",
      "@type": "propertyValue"
    }
  ]
}`

Gender differences

Distributions by dataset